Working Paper 34 UNITED NATIONS ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS

نویسنده

  • Masayoshi Takahashi
چکیده

1. Missing data problems are ubiquitous in many fields, including official statistics, where one of the common treatments of missing data is ratio imputation (de Waal et al., 2011; Thompson & Washington, 2012; Office for National Statistics, 2014). On the other hand, multiple imputation has been the recommended practice from statisticians (Rubin, 1987; Little & Rubin, 2002). Among statisticians, multiple imputation is known to be the gold standard of treating missing data (Baraldi & Enders, 2010; Cheema, 2014). While ratio imputation is often employed to deal with missing values in practice, the literature is devoid of multiple ratio imputation, leading to a gap between theory and practice. This paper proposes a novel application of the Expectation-Maximization with Bootstrapping (EMB) algorithm to ratio imputation, where multiply-imputed values will be created for each missing value. The objective of this paper is to present the mechanism of multiple ratio imputation and to assess the performance compared to traditional imputation methods. For this purpose, Monte Carlo simulation is applied to the newlydeveloped R-function for multiple ratio imputation. A small application to the 2012 Japanese Economic Census data is also presented to illustrate the usefulness of multiple ratio imputation. Also, this research implemented multiple ratio imputation by the Expectation-Maximization with Bootstrapping (EMB) algorithm in the R statistical environment (to be released soon).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Working Paper No. 30 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT)

In this paper we give an overview of various approaches to the implementation of statistical disclosure control to tabular data released through the Web. We consider three generic groups of statistical disclosure control methods: source data perturbation, output perturbation and query-set restriction. Considering different types of Web-sites and implementation approaches we discuss the appropri...

متن کامل

WP. 9 ENGLISH ONLY UNITED NATIONS STATISTICAL COMMISSION and ECONOMIC COMMISSION FOR EUROPE CONFERENCE OF EUROPEAN STATISTICIANS EUROPEAN COMMISSION STATISTICAL OFFICE OF THE EUROPEAN COMMUNITIES (EUROSTAT)

The concept of differential privacy has received considerable attention in the literature recently. In this paper we evaluate the masking mechanism based on Laplace noise addition to satisfy differential privacy. The results of this study indicate that the Laplace based noise addition procedure does not satisfy the requirements of differential privacy.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015